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Abstract 

The traditional axiomatic approach to voting is motivated by the problem of reconciling differences 
in subjective preferences. In contrast, a dominant line of work in the theory of voting over the past 
15 years has considered a different kind of scenario, also fundamental to voting, in which there is a 
genuinely "best" outcome that voters would agree on if they only had enough information. This type 
of scenario has its roots in the classical Condorcet Jury Theorem; it includes cases such as jurors in a 
criminal trial who all want to reach the correct verdict but disagree in their inferences from the available 
evidence, or a corporate board of directors who all want to improve the company's revenue, but who 
have different information that favors different options. 

This style of voting leads to a natural set of questions: each voter has a private signal that provides 
probabilistic information about which option is best, and a central question is whether a simple plurality 
voting system, which tabulates votes for different options, can cause the group decision to arrive at the 
correct option. We show that plurality voting is powerful enough to achieve this: there is a way for 
voters to map their signals into votes for options in such a way that — with sufficiently many voters — 
the correct option receives the greatest number of votes with high probability. We show further, however, 
that any process for achieving this is inherently expensive in the number of voters it requires: succeeding 
in identifying the correct option with probability at least \ — rj requires ^{n^e^^ log r;^^) voters, where 
n is the number of options and e is a distributional measure of the minimum difference between the 
options. 
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1 Introduction 



Information-Based Voting. A dominant recent theme in the study of voting has been to trace differences 
in voters' preferences back to differences in the information they have about the world. This information- 
based approach has its roots in one of the earliest results in voting theory — the Condorcet Jury Theorem, 
which used the then-young theory of probability to model a situation in which a panel of jurors each wants 
to vote for the correct decision in a trial, but each juror may be wrong about what the correct decision is 
independently and with probability p < \ ifTTl . It is only very recently, however, that this approach has 
received deeper theoretical attention |l2l|4l|6l|7l[T6l, leading to what is now a large and growing body of 
research. 

The basic premise of the information-based approach to voting is that all voters want the best option 
for the group as a whole, but they disagree on what this best option is, based on the information they have. 
This models a wide range of situations where the differences among voters are not purely subjective, but 
instead based on uncertainty. For example, in most criminal trials the key question is genuinely whether 
the defendant committed the crime or not; all jurors want to reach the correct decision, but they disagree on 
which of the pieces of information presented at the trial are most salient. Similarly, all the members of a 
corporate board of directors may genuinely agree that the goal is to reach a decision that will most improve 
the company's future revenue, but they disagree on which course of action is most likely to achieve this. 
Even at the level of large populations, there can be cases where each voter wants a candidate whose election 
— for example — will lead to the strongest improvement in the economy, but there is disagreement among 
the voters about which candidate is most likely to achieve this. 

This view of voters as information-processing agents trying to reach a correct decision has made it 
possible to develop models for a range of important phenomena in voting; these include the fact that voters 
realize they might be wrong, and the corollary that they can sometimes be convinced by evidence IHQ, the 
corresponding role of deliberation in committee voting ifTOl . and the fact that many voters may choose to 
abstain or not participate when they believe that others have more accurate information than they do 13i[8J. 

A Basic Model of Information-Based Voting. In this paper, we consider the following basic theoreti- 
cal model that has received wide study 121171. There is a decision to be made, involving selecting from 
among several possible options Ai, . . . , An- One of these options is correct, and all voters want to select 
it. However, which option is correct is determined by a process that cannot be directly observed, and the 
voters have to use indirect signals to infer the correct option. Before casting a vote, each voter t receives 
a private signal equal to some value Sj, providing evidence for the identity of the correct option. (The 
full set of possible signals will be labeled {si, S2, • • • , sc}-) We assume that certain kinds of signals are 
more plentiful when certain options are correct, and that voters know conditional probabilities of the form 
Pr [sj is received \ Ai is correct] = pij . We further assume that no two options induce exactly the same 
set of conditional probabilities over signals. Based on the signal she receives, each voter casts a vote for 
one option, potentially using a randomized rule to map the signal to a vote. A voting system — a rule for 
mapping a collection of votes to a group decision — is then applied to these votes. We are interested in the 
probability that the group decision will be equal to the correct option Ai. 

Much of the power of this model in economics and political science comes from the way in which it sep- 
arates the signals received by the voters from the options they are voting on. This captures a basic property 
of voting in many real-life situation, including the ones described at the outset: the signals represent infor- 
mation and decision-making heuristics that the individual voters possess in their minds, while the options 
correspond to candidates or alternatives presented on a ballot. For many reasons, the institution of voting 
therefore does not (and generally cannot) consist of a simple sharing of everyone's signals. Instead, voters 
are only able to convey the information they possess in a more indirect fashion, by voting for one of the 
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given options. The crucial question is whether there is a (possibly randomized) algorithm each voter can 
apply to his or her signal to produce a vote, in such a way that the correct option is chosen. 

For simplicity in the following discussion, we consider an equivalent formulation of this model (in the 
spirit of lU), via an experiment involving urns and marbles. Suppose an experimenter has a collection of 
urns Ai, . . . , An, and each urn contains marbles of colors si, . . . , sc- The fraction of marbles of color sj 
in urn Ai is equal to pij; no two urns have exactly the same mixture of colors. Now, the experimenter 
announces to a set of test subjects that he is placing one of the urns Ai, An on a. table. Each test subject 
draws and replaces a single marble from the urn on the table, without showing it to the other subjects, and 
then writes down a vote (on a secret ballot) for which urn she believes is on the table. The experimenter 
then applies a voting rule to these votes, producing a group decision, and awards the group a prize if this 
group decision is equal to the true urn that was on the table. It should be clear that this formulation is simply 
a rephrasing of the original model, with the urns representing the options and the marbles representing the 
signals. 

Can Plurality Voting Produce the Correct Answer?. Much of the initial theoretical work on these issues 
focused on the case of n = 2 options — that is, voting when there are two alternatives, such as in a jury trial 
or a yes/no vote on a proposed rule |l2l|4l|6l|7l[T6l. But many settings involve more than two options, and 
in this case the following basic question has remained open. Suppose the votes will be aggregated simply 
using plurality voting, with the urn receiving the most votes chosen as the group decision. Is there a rule 
the voters can use for mapping signals (colors) to votes, such that for any instance of the problem with urns 
Ai, . . . , An, a set of m voters will identify the correct urn with probability converging to 1 as m grows? 
And if so, how large a set of voters is needed to guarantee a success probability of 1 — ry, for a given t] > 01 
Recent work has highlighted the challenge and general lack of understanding of this question with 
more than two options, raising it as an open problem and providing interesting results in highly structured 
special cases where the signal space is rich enough that each option has a disjoint set of one or more signals 
that uniquely favor it 1121 IT3l . For general sets of signals, the question has been open: if the signals are 
expressively weak compared to the full set of options, is there necessarily any strategy for mapping signals 
to votes that would lead to the correct outcome under a simple system like plurality voting? 

Optimal Information-Based Voting: Main Results. Our first main result is that for any finite set of 
signals, and any finite set of options that induce distinct distributions over these signals, there is a strategy 
such that a sufficiently large set of voters can arrive at the correct option with high probability using plurality 
voting. In other words, each voter translates her signal into a vote in such a way that the option receiving 
the most votes is, with high probability, the correct one. 

Second, we show that achieving this goal using plurality voting is very expensive: it requires a large 
number of voters. We give lower and upper bounds on the number of voters needed to achieve a high 
probability of correctness, parametrized by three quantities: the number of options, the number of signals, 
and a quantity measuring the minimum separation between the distributions over signals induced by any 
two options. The lower bound is the technically most involved of our results, and for two signals it is 
asymptotically tight in both the number of options and the separation parameter. 

The technical core of our results is the case in which there are n options and 2 signals. Let e be the 
minimum positive difference between the probability assigned to a fixed signal by two different options 
i and j. With two signals, we show there is a strategy by which 0(n'^e^^ logr?"^) voters can arrive at the 
correct option using plurality voting with probability at least 1 — r]. The strategy is symmetric, in that all 
voters map signals to votes according to the same probabilistic rule. While the algorithm involves a carefully 
designed rule, it is based on a principle that is intuitively natural: the voters "hedge" against the possibility 
that their information points in the wrong direction, by sometimes choosing to vote for an option other than 



2 



the one supported by their signal. The bound achieved by our algorithm is tight: there are instances in 
which il(n^e^^ log t]^^) voters are necessary to achieve such a guarantee; this lower bound applies even to 
asymmetric strategies in which different voters can use different rules. 

Note that by the pigeonhole principle, the minimum difference e is at most l/(n — 1), and hence is a 
parameter that is at least as large as n — 1. For example, the special case with urns Ao,Ai, . . . , An, in which 
Ai contains i blue marbles and n — i red marbles, has e = 1/n, and so for this problem the tight bound on 
the minimum number of voters needed is G(n^ log r/^^). 

A recurring theme in our results is this fifth-power dependence of the number of voters on n, in the case 
when e^^ is close to n. As such, it is useful to provide some intuition at the outset for how this fifth-power 
dependence arises. Thus, the following description is deliberately informal, but gives a sense for where 
this functional form comes from. Let there be m voters, and for simplicity let us consider the special case 
from the previous paragraph, with urns Aq, Ai, . . . , An, in which Ai contains i blue marbles and n — i 
red marbles. Under the asymptotically optimal (randomized) algorithms we consider, the correct urn will 
receive a greater number of votes in expectation than any other urn; this is why, with enough voters, we will 
eventually be able to distinguish the correct urn using plurality voting. Now, we will find that the optimal 
algorithm has the following two properties. First, it spreads out the votes relatively uniformly across a set of 
G(n) urns, and so if there are m voters, each of the urns in this set receives Q{m/n) votes in expectation. 
The second, subtler property is the crucial one: the optimal algorithm ensures that the conect urn receives 
the most votes in expectation using a delicate optimization under which the the expected number of votes 
received by the correct urn will exceed the expected number of votes received by the adjacent urns by a 
factor of only 1 + 5, where 5 = ©(n^^). As a result, to distinguish the correct urn with high probability, we 
need a number of samples that is sufficient to yield at least ©((5~^) = 0(n^) votes for the correct urn. But 
since the correct urn receives only Q{m/n) votes in expectation, this means that we need m to be ©(n^). 

We observe that in our more general bound 0(n^e~^ logr/^^), the form of the dependence on e^^ is in 
fact necessary even if the voters could share their signals (rather than casting individual votes). Indeed, with 
n = 2 options that assign probabilities to signals differing by only e, even a single observer would need 
to see ©(e^^ log ?7~^) signals in order to identify the correct option with probability at least 1 — rj. Thus, 
with a constant number of options, plurality voting is allowing voters to aggregate their information with an 
efficiency that is within a constant factor of the efficiency achievable by a single person who could observe 
all signals directly. 

For the case of C > 2 possible signals or colors, let e denote the minimum distance^ of two distinct 
urns' probability distributions. We have an upper bound of O ^(C log C)^n^e~^ log on the number of 
voters needed. Since the lower bound for the two-signal case applies with C > 2, it is tight in e, and we lose 
only an exponentially small factor in n. Finding the correct dependence of the required number of voters on 
n and C is an interesting open question. 

Under plurality voting, voters can only communicate the name of a single option in response to a signal. 
We also consider voting systems that allow voters to be much more expressive: cumulative voting, in which 
each vote consists of assigning a non-negative weight to each option (such that the weights sum to 1); and 
Condorcet voting, in which each vote consists of a ranking of all the options. For bichromatic urns, we 
show that cumulative voting requires only 0(e~^ log rj^^) voters in order to succeed with high probability; 
this is tight even compared to the baseline discussed above, when a single observer has access to all the 
signals. We show that a similar bound would hold for Condorcet voting, modulo an intriguing conjecture 
about distributions over permutations. 

'We observe that, in the multicolor case, choosing the right parameter to define a notion of "distance" between urns is not as 
straightforward as in the bichromatic case. We chose £i because it is the parameter that has been used in the literature to determine 
the minimum number of samples that allows an algorithm to distinguish between probability distributions. 
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Optimal Information-Based Voting: Main Teciiniques. The possibility result for identifying the correct 
option is based on a technique that implicitly draws a connection to the framework of proper scoring rules 
from statistics 1 1 1 1. Proper scoring rules can be thought of as incentive systems for eliciting accurate proba- 
bilistic forecasts from expert predictors; the contexts in which they have been used in earlier work are quite 
different from ours, and to our knowledge there have not been previous linkages between proper scoring 
rules and information-based voting. 

A construction based on proper scoring rules provides the first method for obtaining the correct option 
using plurality voting. However, we need to go beyond this construction in order to obtain a tight bound 
on the number of voters needed: in a sense to be made precise below, we can prove that any direct use of 
proper scoring rules in our setting requires at least $7(ne~^) voters to achieve a high probability of success. 
This is at least r2(n^e~^) since e < (n — 1)^^, and more significantly, it has an asymptotically sub-optimal 
dependence on e of r2(e~^) when n is a constant, whereas our stronger approach achieves the optimal 
dependence of 0(e^^ log 77^^) for constant n. 

For the lower bound, we need to show that with 0{n?e^^ log rj^^) voters, there is a probability -q that 
plurality voting will choose the wrong option. For this, we identify a natural "close competitor" j of the 
correct option i, with a very similar signal distribution, and we consider a random variable that measures the 
extent to which the number of votes for the correct option i exceed the number for this competitor j. The 
(possibly asymmetric) strategies of the voters determine the variance of this random variable, and roughly 
speaking we follow a two-pronged argument in terms of this variance. If this variance is too low, then there 
is a high chance that voters would behave the same regardless of whether the option generating the signals 
was i or j, and hence that if they are correct about i with high probability, then they would have to be wrong 
with constant probability when j is the correct option. If the variance is above a certain low threshold, on 
the other hand, then we apply a carefully tuned "anti-concentration" inequality from [9, 14] showing that 
there is a constant probability that the number of votes for i will drop below the number for its competitor j. 

Further Related Work. Finally, we mention two other recent lines of work that have also considered 
the problem faced by a set of agents trying to agree on a joint decision from a set of alternatives. Mossel, 
Sly, and Tamuz study a version of the problem in which there are two options, and each agent is given a 
probabilistic signal providing information about which option is correct ifTSll ; in contrast to our approach and 
to the work on voting discussed above, they consider a model in which agents may communicate iteratively 
over multiple rounds. Caragiannis and Procaccia consider a setting based on agents that possess utilities 
over options; within this framework, they show that simple voting rules can approximately optimize the sum 
of agents' utilities for the option that is selected IS!. 

2 An Upper Bound with Two Signals 

We begin by considering the case of two signals. Suppose we have a collection of n urns, labeled pi , . . . , pn, 
the z-th of which having a pi fraction of blue balls and a 1 —pi fraction of red balls, with pi < P2 < • • • < Pn- 
We let e denote the smallest difference between two consecutive pj's: e = mino<j<n_i {pi+i — Pi) ■ 

We assume that one urn is adversarially chosen as the correct one (we will also refer to this as the 
unknown urn). Then each player draws a ball from the urn and votes for the name of an urn based on the 
color they observe. 

We describe the strategy that the players will use to randomly choose which vote to cast: 



fe-i 



2 - + p() 



n-l 

and r-fc = ^ 

l=k 



1. Let6fc = ^ 

1=1 



Pi+i + Pi 



. Then define R = Ylk=i rk and B = YJk=i ^k. 



Pt+i - pt 



Pi+i - pe 
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and set M = max(i?, B). 

2. The probability that a voter will vote for pj if a red ball is drawn is Rj = ■ (r-j + ^^^' ) ■ 

3. The probability that a voter will vote for pj if a blue ball is drawn is Bj = ■ {bj + '^~^ ) ■ 

It is easy to check that the two given distribution are indeed probability distributions (their values are 
non-negative and they both sum up to one). Now, the probabiUty that a player will vote for pj given that the 
correct, adversarially chosen, distribution is pi, is 

Pr [P=p,]=pi Pr [P=p.] + (l-p,) Pr [P = p^] = p^B, + (1 - p,)R, = EiiJ). 
X~(pi,l-pi)' P~/(blue)' 'p~/(red)' 

P~/(X) 

Now consider two urns, pi and pj. We compute the difference between the probabihties that a vote for 
urn Pi and a vote for urn pj are cast, given that the correct urn is pf. 



Ai{j) = Ei{i) - Ei{j) = Pi {Bi - Bj) + (1 - Pi) {Ri - R 



'3) 



We will lower-bound Ai(j) to bound the number of voters needed to let the voting scheme be successful 
with high probability. Suppose first that j < i; then 

A /■\ 2 - {p(,+i + p(,) ^ pi+i +pi ^2 -Pi - {pi+i + Pi) 
M ■ Ai{j) =Pi-y_^ (1 - Pi) • > = > , 

^ Pi+i-Pi f^jPe+i-Pi ^ Pe+i-Pe 

observing that in each term of the sum we have pi > pe+i, since £ < i — 1. Therefore, 

2p^+i - 0^(+i + Pe) ^^Pe+i- Pe 

l=j 

If, instead, i < j we have: 



P^+i -Pt T~ Pi+^ ~ Pi 



M ■ Ai{j) = -pi.\ + (1 _ p.) . \ 

Pi+i -pe, pi+i - pe 

^ {Pt+i + Pt) - "^Pi y {pe+i + Pe) - ^Pe ^ Pe+i - Pe ^ 
^ Pe+i -Pe ~ jz'i Pi+'^ ~Pi i^i Pi+^ ~ Pi 

where the inequality follows from pi < pe. Therefore for j ^ i, we have 

Mj) > (1) 

We now give an upper bound on the probabiUty that the correct urn will be chosen by a voter. Note, 
somewhat counter-intuitively, that the probability of a correct vote is higher when this upper bound is smaller 
— this is because the Aj(j) are additive gaps, not multiplicative ones, and so by making the upper bound on 
the expected number of votes for the correct urn smaller, the gap Aj(j) becomes larger relative to the mean. 

Recall that the correct urn is pi. We upper-bound the probability that a vote will go to pi: 
m = Pi ■ . {b, + ^) + (1 - Pi) . . [r. + ^) 
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i-1 



Observe that, by the definition of e, we have that bi 

n-l 



- — (^^+1 ~*~ satisfies h < ^, and furthermore 



that r, 



mi +Pt ^ 2(n - z) 



< 



Pt+i - pe 



Thus, 



Eiii)<Pi-(^ + l]+ii-Pi 



2(n-i) 1\ 2n 1 

-^^ + - < h -. 

eM n ~ eM n 



We now give an upper bound on M. This will allow us to apply a Chernoff bound and finish the proof. 
Recall that M = max(i?, B); we will upper bound R + B to get an upper bound on M: 

n n n—1 _ 



R + B = ^(rfc + bk) < ^(ri + bn) = n-{n + bn) = n-J2 



< 2 



n(n — 1) 



k=l 

It follows that 



k=l 



pe+1 - pe 



M < 



2n(n - 1) 



(2) 



Therefore, going back to the probability that an urn identical to the correct urn is voted for, we have 

2n 1 2n 1 2n(n - 1) 4n 

Ei(i) < h-< \ ^ < . 

^ ^ - eM n- eM n Me " eM 

Furthermore, since Ai{j) > for each j ^ i, we have that the um pi is the most likely urn to be voted for, 
and therefore Ei {i) > K 

We are now ready to state the main theorem of the section. Its proof employs a careful application of 
the Chernoff bound, and the inequalities we have derived in this section. 

Theorem 2.1. Let urns pi,P2, ■ ■ ■ ,Pn be given, with um pi having a Pi fraction of blue balls, and al — pi 

fraction of red balls. Let < pi < P2 < • • ■ < < 1- Also, let ebe e = m.mi<i<:n-i{pi+i — Pi)- Then, 
for Plurality Voting, O (n'^e~^ In ry~^) voters are sufficient to guarantee a probability of at least 1 — r] that 
the correct urn receives the most votes. 



Proof. Observe that e < Choose some ry G (0,1), and suppose the number of players is 



m 



108 



M(n-l) 



— we will show that m players wiU be enough to choose the correct option with 



probability is at least 1 — r/. Observe that, given our upper bound M < 



2n(n-l) 



, we have 



m < 



216 • 



(n — l)^n' 



Recall that we say that the players lose if an um with a different distribution from the unknown um 
wins the election. We will upper-bound the probability that the players lose, using the following form of the 
Chemoff bound: 

Theorem 2.2 (Chemoff bound). Let Xi, . . . , Xm be independent 0/1 random variables with expectation 
E[Xi\ = Pi, for i = 1, . . . ,n. Let ji = YliPi- Then, for each 5 >0, it holds that 



Pr 



/ min (5, (52) ^ 
< exp ^- ■ p 
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and, 



Pr 



< exp ( -— • fi 



We now show how to use Theorem I2.2[ together with the bounds derived in Section I5.1[ to prove 
Theorem 12.11 Let Vj be the number of votes to pj in the random election, with unknown urn i. Then, 
E[Vj] = Ei{j) ■ m. We have. 



Pr[the players lose] = Pr[pj did not collect more votes than any other urn] 



< Pr 



Vi < E[Vi] 



m 



3m] 



i=o 



Vj > E[Vi\ 



2m 
3M 



Since Aj(j) > we have E[Vi] > E[Vj] + ^ • m, and 



Pr[the players lose] < Pr 



< Pr 



V^ < E[V^ 1 



m 



V, < E[Vi] 1 



3ME[Vi]) 

m \ 
3ME[Vi]) 



n 
j=0 



% — 7 ^TTl \ 



V, > £|V,] 1 + 



\t — j\m 
3ME[Vj] 



< 



exp 



m 



27 M^E[Vi\ 



+ 2 exp ( — mm 
k=l 



( km k'^m'^ | 
\9M' 27 M'^E[Vj] J 



by E[Vj] < E[Vi] < m ■ 



tM 



Pr[the players lose] < exp 

< exp 

< exp 



m^eM 



108 M2 



exp I — mm 



m e 



108 M (n 



— ^ + 2 ^ exp ^- mill | 



k'^m^eM 



( km 

\9M' 108M2m(n-l) 
k'^me 



( km 

9M' 108 M(n - 1) 



m e 



< exp 



108 M(n 
m € 



k'^me 



108 M(n - 1) 



108 M(n 



-]+2X:exp 

'^^ fc=i 

^)+2|:e.p(-*=lnl) 



mm 



12 (n - 1) 



< 



< - + 2 • 

4 4 - r/ 4 



< ^ + 2 • ^ < r/. 



It follows that if m = ©(n^e ^ log r/ ^), and voters apply the aforementioned voting scheme, the probability 
of winning is at least 1 — tj. □ 



3 A Connection to Proper Scoring Rules 

In this section we discuss the connection between our upper bound and the notion of a proper scoring 
rule We first show how to obtain a strategy for a set of voters in the two-signal case using proper 
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scoring rulesH We then show that basing a strategy on proper scoring rules cannot lead to an asymptotically 
tight result: any voting strategy based on the functions arising from the framework of proper scoring rules 
requires at least Q, (ne^"^) voters. This is weaker than the upper bound of 0{n^e^'^ log rj^^) that we obtained 
in Section [2]in two important respects. First, by the pigeon-hole principle, e < and therefore approach 
from the previous section is always at least as good as the approach based on proper scoring rules, and often 
much better. More significantly, when n is a constant, the approach via scoring rules gives a dependence 
on e of 0(e^^), whereas our approach from Section |2] gives 0(e~^), which is optimal even if the group 
of voters could directly share all their signals. (In other words, even if there were just a single voter who 
received all the signals.) 

For our purposes in this discussion, it is not necessary to introduce the full theory of proper scoring 
rules, but just to provide a self-contained consequence of that theory. The consequence is the following: it is 
possible to construct pairs of non-negative functions (/o, /i), each defined over the interval [0, 1], with the 
property that for all z G [0, 1], the function 

g,{x) = zfoix) + {l-z)fi{x) (3) 

is uniquely maximized at x = z. We will further assume that /o and fi each have continuous second 
derivatives, which is true of the standard functions that arise from this theory. This defining property of /o 
and /i is all we will need. 

From a pair of such functions, here is how we can define a strategy for each voter in the two-signal 
case. We have a set of n + 1 urns, where urn i has a probability pi of producing a blue ball. We define 
qo = fo{pi) and qi = J2i fi{Pi)' ^^id l^t Q* = max(gi, go). Now, when a voter draws a blue ball, they 

y / \ * 

vote for urn i with probabihty proportional to — H ; -; if they draw a red ball, they vote for urn 

q* q*{n + l) 

i with probability proportional to — i -. We call this the strategy induced by /o and fi. 

q* q*{n + 1) 

Suppose the true urn is t; then the number of votes for an urn j is a random variable Xj = Xj^, 
where Xj^ is the indicator variable that voter v votes for j. With k voters, we have 

E = —iPtfoiPj) + 1 - Pt)fi{Pj)) + = —9pt (Pj) + w ^ IX ■ (4) 

g* g*(n + lj q* q*[n+l) 

By the defining property of /o and /i, we see that E [Xj] is uniquely maximized at j = t. Hence for a 
sufficiently large set of voters, the number of votes received by urn t will exceed the number received by all 
other urns with high probability. 

Thus, the strategy induced by any proper scoring rule will produce the true urn with high probability 
when there are enough voters. It can be viewed, in a sense, as a much simpler version of the construction in 
Section |2j and we now show that this simpler approach results in asymptotically larger number of voters. 

Theorem 3.1. Let /o and fi be any functions with continuous second derivatives that satisfy the defining 
property of proper scoring rules from Equation ((21). Then in order for the strategy induced by /q and fi to 
identify the true urn with high probability, there must be $7(ne~^) voters. 

Proof. We start with a basic claim about sums of Bernoulli trials. Let X = ^(^i Xi be a sum of independent 
0-1 random variables, where E [Xi] = pi < |. The mean of X is /i = Yli=iPi- Then with constant 
probability, X will deviate by at least a constant multiple of VVar X from fi. More concretely, there are 
absolute constants a > and /3 > so that with probability at least a, we have X < fj, — /3\/Var X. Now, 
since 

VarXj =pi{l -pi) > pi/2, 



^We are grateful to Bobby Kleinberg for identifying this connection between voting strategies and proper scoring rules. 
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we have 

k k 

VarX = ^VarXi > J^Pi/^ - '"Z^" 

i=l i=l 

Now, for a given 6 > 0, suppose we have fi < /3^/ {2S^). Then equivalently, S < P/s/^, so 



5fi < /3vW2 < pVYarX. 

Hence with probability at least a > 0, we have X < (1 — It follows that in order to ensure X > (1 — 
with probabihty going to 1, we must have // > 0(1/5^). 

Now, recall that there are k voters, and consider the voting strategy induced by the functions /o and /i. 
Since the first derivatives /q and /{ are continuous functions defined over the compact set [0, 1], there is a 
constant ci such that |/o(a;)|, < ci for all x G [0, 1]. For the same reason, there is a constant C2 

such that l/o (x)!, |/f < C2 for all x G [0, 1]. Using the bound on the first derivative, for any 7 > we 

can find an interval [u, v] C [0, 1] such that the following hold: (i) d = inf min(/o(x), fi{x)) > 0, (ii) 

xe[u,v] 

v/u<l + 7, (iii) (1 - u)/{l ~v) <l + j, 

(iv) sup — — < 1 + 7, and (v) sup — — - < 1 + 7. 

x,y£[u,v] J0\X) x,y£[u,v] /iV-^J 

It follows that if our probabihties po,pi, . . . ,pn sAl lie in this interval [u,v], then 

"(i-7i)A; (i + 7i)^" 



E[Xj]e 



n n 



for a constant 71 that goes to with 7. Also, we have q* > dn. 

Now, for any e > 0, we choose Po < pi < ■ ■ ■ < Pn ^ [u-,v\ such that Pj+i — Pj = ^ for each j. Let t 
be the true urn, and let 

hix) = ^{ptMx) + {l-Pt)h{x)) + ^^^l^- 
Notice that E [Xj\ = h{pj). Now, Taylor's Theorem impUes that for some w G \pt,pt+i], we have 

h{pt+i) = h{pt) + {pt+\ -Pt)h'{pt) + ^(Pt+i -pt)^h"{w). 

Since h{x) has its global maximum atx = pt, we have h'{pt) = 0. Moreover, since |/o (a;)|, |/('(a;)| < C2 
for all X G [0, 1], we have 

h"{w) = ^{ptfl^ix) + {I-pMx)) < ^. 



Writing pt+i — Pt = e, we have 
Since E [Xj] = h{pj), for all j, this implies 



Kpt+i) > hipt) - 



kc2 2 



E[Xt^i]>E[Xt]-^e 
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Since E[Xt] > — — this implies that E[Xt+i] > (1 - 6)E[Xt], where 6 = cae^ and C3 = 
n 

2(l-7i)(i- 

Now, using our initial fact about sums of Bernoulli trials, we must have E [Xt] > r2(l/(^^) in order for 

Xt to have a high probability of exceeding (1 — 6)E [XA. Since E [Xt] < , this requires 

n 

(l+7i)fe > d2 



for a constant ^2 > 0, and hence 



□ 

4 A Tight Lower Bound for Two Signals 

In this section we give a tight lower bound that confirms the optimality of the voting scheme for two signals 
presented in Section |2l We will start by introducing a class of instances. We will then prove a combinatorial 
lemma on how certain parameters of any (asymmetric) voting system for these instances have to behave, 
and we use the lemma to prove the lower bound. 

We start by defining the lower bound class of instances X(n, e), for any n > 2 and e < The n urns 

inX(n, e) are such that pi = ^ \- {i — l)e> for i = 1, . . . , n. Then < pi < p2 < • • • < Pn < 1- 

Each voter t is defined by two probability distributions R2,t: ■ ■ ■ , Rn,t), {Bi^t, -62,4, . . . , Bn.t)'- if 

she draws a red (resp., blue) ball she will vote for urn i with probability i2j j (resp., Bi^t)- 

Given a voting scheme for m voters (that is, 2m probability vectors {Ri,t), (Bi^t)), we define Bi = 
■ J2u=i Bi,t and Ri = m^^ ■ J^'^i Ri,t, for i = 1, . . . ,m. Thus the expected number of votes Ei{j) 
to urn j, if i is the correct urn, will be equal to m • Ei{j) = m ■ {pi ■ Bj + (1 — Pi) ■ Rj) ■ We also define 
Aj(j) = Ei{i) — Ei{j) to be the expected difference between the number of votes to i and j, if i is the 
correct urn, averaged over the m voters. 

We say that a voting scheme is proper if Aj(j) > 0, for each The challenge in proving the lower 
bound lies in the fact that proper voting schemes can succeed in identifying the correct urn for what seem 
to be a variety of different reasons, and so we need to find a common property they have which implies that 
the correct urn only "narrowly" wins the election over other urns with very similar distributions. This is the 
content of the following lemma. 

Lemma 4.1. Let n and ebe n > 10 and e < Then all proper voting schemes for X(n, e) satisfy: 

(a) Bi<B2<-- - <Bn<landl>Ri>R2>--->Rn; 

(b) E,ii) < lfori = l,...,n. 

(c) There exists a set S CI [n] and l G { — 1, +1}, with \S\ > ^ — 31nn — 14, such that for each i ^ S, 
we have 

max{\Ri - , \Bi - Sj+J) < ei • VJ^iZ^^ 



n2 

and 

7 



A,(i + 0, Ai+,(i) < 2e5 . e . ^ 



n2 
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The crux of the lemma is to show that for many pairs of urns + l, the election will be very "close": if 
i is the correct urn, it does not win the election by a large margin over i + tin expectation (and vice versa). 
The lemma shows further that, averaged over the voters, the difference between the probability of voting for 
i given a red (resp., blue) ball and the probability of voting for z + i given the same color is small. 

This upper bound is crucial for the proof of the lower-bound theorem, stated next: we will show that 
— even if we only cared about urns i,i + l — the variance of a voter's choice can be lower-bounded by 
Q.{\Ri^t — Bi^t\)- This, assuming that the total variance is at least some constant, will allow us to apply 
an anti-concentration inequality to show that the expected margin Aj [i + l) of urn i over urn i + t will be 
surpassed by G (inr/^^) standard deviations of the number of votes to urn i and i -|- 1. It will follow that with 
probability ^1(7?) the election will be won by the wrong urn. Again, this argument requires that the variance 
be at least some sufficiently large constant; if the variance is actually smaller than this constant, we will use 
a different argument showing that the voting system is sufficiently "inflexible" that if urn i wins when it is 
correct, the same pattern of votes is likely to also arise — favoring i — when i + i\s actually correct. 

Proof. We will first show that in a proper voting scheme, for each i < jii holds that Bi < Bj and Ri> Rj. 
This implies Bq < Bi < ■ ■ ■ < Bn-i and Ro > Ri > ■ ■ ■ > Rn-i- By contradiction, 

• if Bi < Bj and Ri < Rj, then Ek{j) > Ek{i) for each k: in particular for k = i, which would give 
Aj(j) < 0, contradicting the propemess of the voting scheme; 

• the same argument gives a contradiction if Bi > Bj and Ri > Rj (choosing k = j); 

• finally, assume B^ > Bj, Ri < Rj, and that Ei{i) > Ei{j), Ej{i) < Ej{j); then, 

Ei(i) - Ej{i} > EiU) - Ejij) 
{Pi - Pj)Bi + [pj - Pi)Ri > {pi - Pj)Bj + {pj - pi)Rj 
{Pi - Pj){Bi - Bj) > {pj - Pi){Rj - Ri), 

then, by pi < pj, we have 

{Rj - Ri) + {Bi - Bj) < 0, 
which is impossible since the left-hand side is positive by Bi > Bj and Rj > Ri. 

It follows that Bi < Bj and Ri > Rj. We now show that -Bn < | (resp., Ri < f ). Since {Bi}'1^-^ and 
{Ri}f^i are probability distributions, one has that \ {i \ Bi + Ri < | } | > §, for otherwise 

n n n ^ 

i=l i=l i=l 

Since P|^n+ij > (resp., < ^^), it follows that there exists some i such that Bi + Ri < and 

Pi > ^ (resp.. Pi < ^). Observe that Ei{i) < -. Now, by contradiction, let Bn > ^ {Ri > ^); by n > 
10 we have e < < |; therefore > | (pi < |) one has Ei{n) >Pi-Bn>^ {Ei{l) >Pi-Ri> ^). 
It follows that Aj(n) (Aj(l)) is negative, contradicting propemess. 

We define 5i = \Ri ~ Bi\. Then (5„ < B^ < ^ and Si < Ri < ^. Let kr be the largest integer such that 
Rkr+i ^ Bk^^i, and fc^ be the largest integer such that Bn-k^ > Rn-kf Observe that (a) if i < kr then 
Si > (^i+i, (h) if i > n — kii + 1 then 6i > 5i-i, and (c) kr + kh > n — 2. By (c) at least one of kr and ki, 
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has to be at least § — 1- We let Sr and S's be 

5/? = |i I Ri+i > Bi+i A max {Ri - Ri+i,Bi+i - Bi) < e^'^ ■ ^^3/^ | , 

SB = ^i\ B,^i > A max - Ri, B, - Bi_i) < e^^ ■ | . 

Then 5/? C {1, 2, ... , kr} and Sb {n — kb + l,n — kb + 2, . . . ,n — 1, n}. We consider two cases: 
• suppose kr > ^ — 1. We relabel the element in Sr = [kr] — Sr, using r = \Sji\: 

Sr = {H,'i'2, ■ ■ ■ ,ir}, 

with ii < i2 < ■ ■ ■ < ir- We have Si > 62 > ■ ■ ■ > Sk^ > Sk^+i- Then, for 1 < t < r — 1, 

< < - e'" . = _ . ^ = ... . (1 - ^) , (5, 

and <5i, < f < ^; we define = e^"'^ so that < oq • n^^. Let ^0 = 1 and 4 = [n • e-**+"/'] , 
for A; > 1. We also let L{k) = J2'j=o ^j- We will show by induction on k that < ak • = 
g3-fc . j^-i jj^g g^gg A; = has already been verified. We assume A; > 1. Then, 



fc+3 ■ 



gfc+3 \ / e 2 



fc+3 ■ 

e 2 



^(fc+3)/2 



Now, if A; > 11 + 3 Inn, we have Si^^^,^ < ■ e ^; by we would then get < ^i^^^,^ ■ 

(1 — e) < — since > 0, by ir < kr, this implies that r = \Sji\ < L ([11 + 31nn]). We now 
upper bound L{k) to get an upper bound on r = \Sr\: 



k 

fc+3 
2 



j=0 j=l 

k 00 

= A; + 1 + n • ^ e"^ < A: + 1 + n • e"'/' • ^ e"''/' 

= A: + 1 + n • e-"'^ ■ ^ = A; + 1 + — < - + A; + 1. 

It follows that r = \'Sb\ < L([ll + 31nn]) < f + [11 + 31nn] + 1 < f + 31nn + 13, and 
therefore 

\Sr\ >kr-- -31nn- 13 > - -31nn- 14. 
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• Otherwise, kr < ^ — 1 and therefore A:;, > ^ — 1. A proof similar to the previous case gives 

\Sb\ > /cfc - - -31nn- 13 > - -Sinn -14. 

Therefore, at least one of Sr and Sr has cardinality at least | — 3 In n — 14. If Sr is the largest one we 
pick L = 1 and S = Sr. Otherwise, we pick l = —1 and S = Sr- Observe that the choice satisfies the first 
requirement of point (c) in the statement. 

We now prove the second requirement of point (c). Let i be an element of S, (3 = Bi — BiJ^^, and 
p = Ri — Ri+L. Then, |/3| = — i/3 and \p\ = bp. Also, 



\p\ < 62 • ^ 



3 



Recall that Ai(i + i) = Ei{i) - Ei(i + i) = ppi + p{l - pi) and = Ei+,{i + l) - Ej+iji ) = 

—(3pij[-^ — p{l — Pi+l). Suppose that at least one of Aj(i + i) and Ai+t(i) is larger than 2e^e ^ '3 — -. By 
the propemess of the voting system, we would have: 



Ai{i + + Ai+,(i) > 2e2 • e • ^^^^ - 



j3{pi - pi+,) + p{pi+, - Pi) > 2e2 • e 



n2 

7 



by the definition of the instance, we have that pi+t — pi = i ■ e, therefore we would have 



+ \p\ > 2e 



V\Ri - B, 

3 

n2 



7 / \J^- _g . I 

which would imply that at least one of and \p\ is larger than ea • '3 — -, a contradiction. It follows 
that both Ai{i + l) and Aj4.t(i) are less than or equal 



n2 



□ 



Theorem 4.2. There exists a positive constant H such that for any rj < H, one has that any voting scheme 
forZ(n, e), with n > 120 and e < x^^^^i) using at most O log voters will fail to win the election 
with probability ^l{r]). 

Proof. Take any asymmetric voting scheme for X(n, e) with m voters — that is, a sequence of m vectors 
t, • • • , Rn,t) and {Bi^t, ■ ■ ■ , Bn^t), for 1 < t < m, such that the probability that the tth voter votes 
for the ith urn if she draws a blue (resp., red) ball is Bi^t (resp., Ri^t)- Let Bi = m^^ ■ Ylu=i ^i,t and 
Ri = • Ylt^i Ri,t- 

If the voting scheme is improper, then by definition there exists i,j such that Aj(j) < 0. Otherwise, by 
n > 120, one has ^ — 3 In n — 14 > 1, and by Lemma I4TT1 there will exist two urns i and j £ {i — l,i + 1} 



such that A,(j) < 2eh^^^\^. 



13 



Given we define tiie liead-to-liead (i, j)-voting process as follows; for each voter t, the random 
variable Xt = Xt{i,j) will be defined as 

1 if voter t votes for urn j, given that the unknown urn is 

Xf = i 1/2 if voter t does not vote for urns i or j, given that the unknown urn is i, 
if voter t votes for the unknown urn i. 

Observe that X = Ylt^i Xt>Y number of votes to urn j is not smaller than the number of votes to 

the right urn i. In this case, the voters will lose the election. Furthermore, 

E[X] = ^-l^-^i{3)- 

Since X is the sum of independent random variables, we have that Var[X] = Y^^i Var[X(]; by e < 
]j(n — 1), we have that Yi < pi < P2 < ■ ■ ■ < Pn < ji- We will use that pi,l — pi > \ for each i, to 
lower-bound the variance of Xt. 

Var[Xt] = {pi ■ Bi,t + (1 - Pi) ■ Ri,t) ■ (0 - E[Xt]f + (k • + (1 - Pi) ■ Rj,t) ■ (1 - E[Xt\f 

+ {pi ■ (1 - Bi^t - Bj,t) + (1 - Pi) . (1 - Ri,t - Rj,t)) ■ Q - E[Xt]^ . 

We consider two cases: 
• ifE[Xt] > i,then 

Var[X,] > {p,B,, + (1 - p,)R,,) ■ (0 - E[Xt])' > ^l£±^ . 1 > 1^^'* " 



4 16 - 64 

if E[Xt] < 3, we have 

Var[Xt] > {piBj^t + (1 - Pi)Rj,t) (1 - E[Xt])^ 

+ {p,{l - B,^t - B,^t) + (1 - Pi){l - Ri,t - Rj,t)) (I - E[Xt] 



^ PiBj,t + {l-Pi)Rj,t ^ Pi{l - Bi^t - Bj,t) + (1 - Pi){l - Ri,t - Rj,t) 



16 16 
_ Pi{l-Bi,t) + il-Pi)il-Ri,t) 
16 

The latter is equal to both ^pi{Ri^t - Bi^t) + (1 - Ri,t) and ^ (1 - Pi){Bi^t - Ri,t) + (1 - ^i,*); we 
can therefore get a lower bound of 

Var[Xt] > 4 • min(pi, 1 - pi) ■ \Ri,t - i^^l > ~ 



It follows that Var[X] = Y."U Var[Xt] > ^ ■ Ei=i 1^*,* " B^,t\. 

Recall that m ■ Ri = Ylt^i Ri,t and m ■ Bi = Ylt^i Bi,t- Suppose Ri> Bi, then 

m m 

m-\Ri- Bi\=m- {Ri - Bi) = m ■ ^{Ri,t - Bi^t) < m • ^ \Ri^t - < 64 • Var[X]. 

t=i t=i 



14 



If, on the other hand, Bi > Ri,we have 

m m 

m-\Ri-Bi\=m- {Bi - R^) = m ■ ^^(^i.t - Ri,t) < m • ^ \Bi^t - Ri,t\ < 64 • Var[X]. 



t=i t=i 



Therefore, in any case, we have Var[X] > 

We now give a different lower bound on Var[X], that we will use to deal with the case of very small 
variance Var[X]. Let pi^t,Pi/2,t^Po,t be, respectively, the probabilities that Xt = l,Xt = ^ and Xt = 0. 
Then, E[Xt] = pi,t + \ ■ Py2,u and 

\Bx[Xt] = pi,t ■ {E[Xt] - lf+pi/2,t ■ {E[Xt] - +po,t • {E[Xt]f 

We consider three cases: 

• if pi,t = max (pi,t,pi/2,t,Po,t) > \ then if E[Xt] < | we have 

Var[X,] > p,, . {E[X,] - 1)^ > L ^ = _L > 

If instead E[Xt] > f , then 
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If Po,t = max (pi,t,pi/2,t,Po,t) > 5 then we employ a similar approach. If E[Xt\ > | we have 



2^1 1 1^1- Po,t 



Y..[X,]>p,,.{E[X,]Y>-.-^=-> . 
liE[Xt] < |,then 

Var[X,] > py,, {e[X,\ - i)' {E[X,] - 1)^ > ^ + ^ > i-^i. 

• If Pi/2,t = ™^ (pi,t,Pi/2,t,Po,t) > 5, then i < < |, and 

Var[X,] > PI,, {E[X^] - if+p,, {E[Xt]f > ^ + ^ > 1^^. 

In each of the three cases, we had Var[X,] > '-^Hv^^-^y.,.^^^ ^ therefore 

Var[X] = ^ Var[Xt] > 7^ • ~ ™^ {Pu,Py2,uPo,t)) ■ 

Let us now assume that Var[X] < ^ ■ logg i. We will deal with the case Var[X] > ^ • logs ^ later. 
The previous inequaUty then impUes 

m 2 1 

^ (1 - max (pi,t,Pi/2,t,Po,t)) < o logs -• 
t=i ^ ^ 
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Recall that X = X{i,j) > ^ iff the unknown urn i gets at most as many votes as j (and therefore the 
election is lost). In the following we will also consider X' = X{j, i); we have that X' < ^ iff urn i gets at 
least as many votes as the unknown urn j (this also implies that the election is lost). 

Observe that, since ^ < pi < P2 < ■ ■ ■ < < no matter what the unknown urn is, the probabiUty 
that any specific voter votes for any specific urn changes by a constant factor (between | and |) if one 
changes the unknown urn. 

We now show that, given that Var[x] < ^ ■ logs then with probability at least v/9 each voter will 
vote according to its maximum probability choice: that is 

Pr[Vi, Xt equals the value xt that maximizes PT:[Xt = xt]] > -. 

If these choices let an urn different from i win the election, we have proven the theorem. Otherwise, we 
show that — if we exchange the unknown urn with any other urn k — then still with probability at least v/25 
each voter t will vote for the same urn xf, implying either a tie at the top, or that i (which would then not be 
the correct urn anymore) would will the election. 

We let St denote the sum of the two minimum probabihties in {pi,t,Pi/2,tjPo,t}', that is sj = 1 — 
max (pi,t,Pi/2,t,Po,t)- Observe that < 1 — | = | for each t. If we define s = Ylt^i ^t^ we also have 
s < I logs r?-^ 

We have, 

m 

Pr [Vt, Xt equals the value xt that maximizes Pr[Xt = xt]] = JJ(1 — 

t=i 

We now lower-bound the product, using the following greedy algorithm: take one of the largest s^/ < |, 
and one of the smallest St > 0, with st 7^ Sf. Then move x = min (| — st', st) > mass from st to 
Sf. Observe that the sum s of the sts remains constant throughout the process; furthermore the product 
nL=i decreases: indeed, consider the product of st ■ Sf before and after the change — we can disregard 
the rest since it remains constant. Let st, sti be the two values before the change, and st — x, St' + x be the 
two values after the change. That product used to be st ■ Sf, and becomes st ■ Sf — x{st' — st) — x"^ — 
the latter is smaller than st ■ st' since x > and > st. Note also that at each step one of the s^'s stops 
being considered (either because it becomes equal to | or equal to 0) — therefore the algorithm terminates. 
At termination there will exist at most one st with value different from | and 0. Furthermore, recalling that 

s = Ylt^i ^t, we conclude that then there will exist exactly 275 different st's with value 2/3, one with 
value < s 



2/3 



^ < i, and all the others having null value. 



3 



Given that s < | logs V ^> we can then minimize the former probabiUty with 

m 

Pr[Vt, Xt equals the value xt that maximizes PT[Xt = xt]] = — st) 



t=i 

> 3^ 

> 3-riog5r,-i]-l 

-9 

1 1 ri 

- 9 ' - 9 

If these sequence of votes guarantees that the unknown urn i loses the election, we are done. Otherwise, we 
exchange the roles of urns i and j. 
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Recall that 



< 



< 



and therefore, for each t, s[ < |s( < |. Indeed, let {a, c} 



{1, 1/2, 0} be such that p„ ^ < < pc,f- If one lets p'i ^,py^ t^P'o,t probabihties that voter t, with 

unknown urn j, will, respectively, vote for i, for an urn other than i and j, and for urn j, then we have 
thatp'i^^ < fpi,t,p'i/2^t < f pi/2,t and p^^^ < |po,t- Therefore p;,^^ + p^,^^ < l{pa,t + Pb,t)- It follows that 
s't = l- p'^^t = p'a^t + p'^^ ^ < I {pa,t + Pb,t) = f st> which is upper bounded by | • Si < | • | = |. 

Observe that the sum s' of the s['s, s' = X^I^i ^t' ^^"^ at most | times the sum s of the sj's; that is, 

Let Xj- be the random variable that, if the unknown urn is j, has value 1 if the t-th voter votes for urn i, 
if she votes for urn j, and 1/2 otherwise; we have: 



Pr [Vt, X[ equals the value xt that maximizes Pr[X( = xt]] = JJ(1 — s't) 



t=i 



Using the same greedy algorithm as before, but moving mass x = min (| — sj) from couples of s['s 
such that s[, < I and s[ > 0, s^, 7^ s^, we get that the previous product is minimized when exactly 



distinct sj's exist having value 4/5, one having value < s' 
Then, 



37i 



I < |, and the rest having null value. 



Pr [Vt, equals the value xt that maximizes Pic[Xt = xt]] = — ■sj) 



> 5' 

_ 4 logs V~ 

>5 

> S-Nsr;-!]-! > ± 

- 25' 

Now, if urn i won with this sequence of votes, it follows that j cannot win. 

We have shown that if Var[X] < ^ logs V~^' then the probability of winning is at most 1 — We now 
assume Var[X] > ^ logg 77"^. We will use the following anti-concentration inequality (see Theorem 7.3.1 
in HH, and Hi) to finish the proof: 

Theorem 4.3 ( Il9l ll4ll ). Let X = Y17=i -^i' where Xi are independent random variables, with Xi G [0, 1], 



fori = l,...,n.Leta'^= Var [X] be > 40000. Then, for each t £ 



— 

100 



it holds that 



Pr [X > E[X]+t] > c - exp 



-) 



for some universal constant c > 0. 



We apply Theorem l4.3l on the random variable X = X{i,j), choosing t 
and t = otherwise. This choice is valid since 



64e7mVar[X] 



,if Ai(i) > 0, 



< 



t 



Var[X 

where the latter holds if m < 



< 



64e7 



' m ■ 



n3e-2Var[X] 



< 



4608e7 In 5 



m 



n'^e 2 In 77 ^ 



< 



100' 



1 



46080000e7 In 5 



n € 



'In 7] ^. 
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We also need Var[X] > 40000 to apply Theorem 14.31 Since Var[X] > ^ log5 r] ^, and rj < H, we 
choose HtoheH = 5-2880000^ obtaining Var[X] > 40000. 

Observe that E[X] = ^ - ^ ■ Ai{j). We show that the event "X > E[X] + t" impUes the event 
"X > y" (which directly implies that the unknown urn i will not win the election). 

If Aj(j) < 0, the claim is trivial, since then E[X] > ^, and t is non-negative. Otherwise, by the bound 
Var[X] > m ■ we get 



which proves that X > E[X]+t =^ X > ^. 
Applying Theorem l4.3[ we get 



Pr [X > E[X] + t] > c • exp 



c • exp 



3Var[X]y 



f 64 _i 

> c • exp • m r? 

V 3 • 46080000 • In 5 ' 

> c- r]. 

The proof is then complete. □ 



5 An Upper Bound for Many Signals 

In this section we consider the voting problem in its full generality: we have a set of n > 2 urns, with each 
urn i = 1, . . . ,n inducing a distinct probability distribution Pi = {pi,i,Pi,2, ■ ■ ■ ,Pi,c) over a set of C signals 
or colors. Let e be the minimum ii distance between the distributions Pi. 

C 

e = min^i(Pj, Pj) = minV" |pi,c - Pj,c\ ■ 

Observe that when C = 2, this parameter e is twice the one that we used in Section |2] 

Theorem 5.1. There exists a voting scheme that, using m = Q ^ (CiogC') n voters, guarantee that 

the unknown urn wins with probability at least 1 — r]. 

The proof of this Theorem spans two subsections (Sections 15.11 and \5?2\i . In Section |5TT] we generalize 
the bichromatic voting scheme so (a) to treat urns that are not "well-separated" as if they were the same - 
this virtually increases the separation parameter e - and (b) to guarantee, under some conditions, that the 
equivalent of the M parameter of the bichromatic voting scheme of Section[2]is not just upper bounded by 
0(n^e~^), but is actually asymptotic to 0(n^e~^). 

In Section 15.21 we use both these properties to devise a new voting scheme that uses the generalized 
bichromatic one as a black box. The main idea of the multicolor voting scheme is to force voters to view the 
urns as bichromatic ones: each voter will choose a color c at random, and consider each urn as a bichromatic 
urn with colors c, c — that is, she will imagine that there are only two colors: "c" and "any color other 
than c". Using this trick directly with the bichromatic voting scheme of section |2] would decrease to the 
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minimum distance between urns in the worst case. We do not want the separation between urns to decrease 
— since that would increase the minimum number of voters needed for the election to be successful — this 
is where property (a) of the generalized bichromatic voting scheme becomes pivotal. Also, we need a way 
to aggregate the votes given to each single urn, in each of the (c, c) bichromatic instances; this has to be 
done in a way that guarantees that the right urn will win with high probabiUty. We manage to do this by 
leveraging on property (b). 



5.1 A More Flexible Upper Bound with Two Signals 

To build a framework that can be used to handle the case of C > 2 signals, it is useful to consider a more 
general formulation of the bichromatic problem in which certain options can induce identical distributions 
over signals (and hence be indistinguishable from each other). We present the analysis in the language of 

urns and colored balls. 

Thus, suppose we have a collection of n urns, labeled pi for i = 1, 2, . . . , n. With a shght abuse of 
notation we let pi and I — Pi be, respectively, the fraction of blue balls, and of red balls, in urn pi. We 
assume w.l.o.g. that < pi < p2 < • • • < Pn < 1- 

We assume that one urn is adversarially chosen as the correct one (we will also refer to this as the 
unknown urn). Then each player draws a ball from the urn and votes for the name of an urn based on 
the color they observe. For this general version with indistinguishable urns, we will be interested in the 
probability that the urn receiving the most votes has the same distribution as the correct one; this general 
formulation is for the sake of the multi-color case later. 

We describe the strategy that the players will use to randomly choose which vote to cast. First of all, 
for some n' > 10, choose < p[ < P2 < ■ ■ ■ < p'^, < I. Let e = mini<j<„/_i (p^+i — p'j) . We require 

that (a) for 1 < A; < 

p'k+1 < (2^ + l)e andp;,_;^ > 1 - {2K + l)e 



= K it holds that p'^_^^ - p'^ < 2e and p'^'-k+i ~ P'n'-k - and (b) 



The p^'s are called the landmarks of the voting scheme. 

IT.;. ^ ^-(P'e+I+Pe) . P'i+i +P'e ^ , , / 

1. Let = 2^ — and rfe = 2^ , _ , for A; = 1, ... , n'. 

Pe+i Pe Pe+i Pe 

2. Let : {pi, . . . — {1, . . . , n'} be a mapping from urns to landmarks' indices, defined so that 

<p{P'i) = k if k maximizes pibj. + (1 — Pi)rk (ties can be broken arbitrarily). 

3. Then define R = ZlLi {{\4>~Hk)\ + l) • r^) and B = J2t=i ((|<^"H^)| + l) • h), and set M = 

max{R, B). 

4. The probabiUty that a voter will vote for pj if a blue ball is drawn is 

JP = Pj\ = ■ f 6^(p^.) + = Bj. 



P~/(blue) \ n 

5. The probability that a voter will vote for pj if a red ball is drawn is 
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It is easy to check that the two probabiUty distributions {Bi,B2, ■ ■ ■ , Bn) and R2, ■ ■ ■ , Rn) are 
well-defined (their values are non-negative and they both sum up to one). Observe that Bi = Bj and 

Ri = Rj if (p{pi) = 4>{pj). 

For a given urn pi, let kf be the smallest positive index such that p'^+ > pi, if such an index exists, and 
fcr be the largest index such that p' < pi, again if the index exists; observe that at least one of kf' and k^ 
has to exist since n' > 10. We show the following lemma: 

Lemma 5.2. For each i = I, . . . ,n, 4>ij>i) either equal to kf or to k^. If, for some i, we have pi = p'^ it 
follows that (p{pi) = kf = k^ = k. 

Proof. For an arbitrary k it holds that 

Pi-hk + {l- Pi) ■rk=Pi- y — 7 + {l-pi)- > y 

fc— 1 (, n' — 1 / . / n' — I I I / 

^sp _^P^ ^sp Pt+i±Pi _ sp Pi+1 + Pe 
e=i ^^+1 Pe e=k ^^+1 £=1 ^^+1 

The latter sum does not depend on k. Therefore pibk + (1 — Pi)rk is maximized with a A; that maximizes the 
total of the former two sums. 

Suppose that k^ exists and that k < k^ maximizes the expression; then, by increasing kto k + l < k^, 

we would remove the term ^f^^f^f from the second sum, and add the term / / to the first one. Since, 

Pfe+l Pk Pk+1 Pk 

by definition p'^ < ^P'k- — Pi' ^^^^ P'k+i + ^'fc < "^Pi' therefore the total value of the first two 
sums would increase. It follows that k < k~ cannot maximize the expression. 

Analogously, suppose then that kf exists and that k > kf maximizes the expression; by decreasing k 

to k — 1 > kf , 'we remove the term — from the first sum, and add the term to the second one. 

- * Pk-Pk-i Pk-Pk-i 

Since pi < p'f.+ < v'k~i < P'k'^^ obtain that 2pi < p'k_i+ p'k and therefore we increase the total value of 
the first two sums; thus k > kf does not maximize the expression. □ 

We now turn to computing the probability that a player will vote for pj given that the correct, adversari- 
ally chosen, distribution is pi'. 

Pr [P=pj]=p, Pr [P=p.] + (l-p.) Pr [P = pj] = p^Bj + {1 - Pi)Rj = Ei{j). 
X~pi ■' P~/(blue) ■' P~/(red) 

P~/(X) 

We compute the difference between the probabilities that a vote for urn p^ and a vote for urn pj are cast, 
given that the correct urn is pf. 

\{j) = Ei{{)-Ei{j). 

We will lower-bound Ai(j) to bound the number of voters needed to let the voting scheme be successful 
with high probabihty. 



Lemma 5.3. For each 1 < i, j < n, it holds that 

A,(j) > 



l0(Pi)-0(Pj)l jf _ f _ f 



max(|0(p,)-^fa)|-i,o) otherwise 
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Proof. We make the expression of Aj(_7) explicit: 

Mj) = Pi (Bi - Bj) + (1 - Pi) {Ri - Rj) = Pi — + (1 - Pi) — . 

Suppose first that (pipj) < k~; then 

M-Ai{j)=pi- — — [1-Pi)- 2^ - — — r = 2^ — — 

^ 2-pi-{p'^^^+p'^) ^ 2-p[^^-{p'^^^+p'^) ^ P^+i - P£ ^ _ , 

where the first inequality follows from <^(pi) > k~ and the second frompj > p' > p'o^^- 
If, instead, ^{pj) > kf we have: 

where the first inequality follows from (/>(pi) < k'l and the second frompj < + < p^. 
Also, recall that ^{pi) is an index that maximizes Pib(j)(^p.>j + (1 — Pi)Ttj,{pi)- Since 

Aj(j) = M"^ • + (1 -Pi)r^(p,)) - + (1 -Pi)r0(p.))) , 

we have that Ai{j) > for each ordered couple of urns i, j. The statement follows. □ 

We now give an upper bound on the probability that the correct urn will be chosen by a voter. Note, 
somewhat counter-intuitively, that the probability of a correct vote is higher when this upper bound is smaller 
— this is because the Aj(j) are additive gaps, not multipUcative ones, and so by making the upper bound on 
the expected number of votes for the correct urn smaller, the gap Ai{j) becomes larger relative to the mean. 

Lemma 5.4. It holds that 

^ , , 2(n' - 1) 1 

eM n 

Proof. Recall that the correct urn is pi. We upper-bound the probability that a vote will actually go to pi. 

Eiii)=pi.M-'.(^b^^p^) + ^^-^^+{l-pi).M-'-(^r^^^) + ^^-^^ 
< Pi ■ (^M-i • 6^(^,) + ^) + (1 - Pi) ■ . r^(^,) + ^ . 
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i-1 



^ 2 - {p'f,-. + p'f) 2(i - 1) 
Observe that, by the definition of e, we have that bi = > ^ — - — ; — is at most bi < , and 



that ri=Y_, — ^ < -■ Thus, 



Ti Pe+i-Pe 



e=i 



P'e+i - P'e 



Ei{i) < Pi 



2{n'-i) ^ 2in'-l) ^ 1 



eM n 



eM n 



□ 

We now give an upper bound on M. This will allow us to apply a Chemoff bound and prove the main 
theorem. 

Lemma 5.5. It holds that 

1 . {n'-l)-{n + n') ^ ^ ^ {ji' - \) ■ {n + n') 
81 e - - e ■ 

Proof. Recall that M = max(i?, B); we will upper bound i? + 5 to get an upper bound on M: 

n' n' 

R + B = Y^ {p'k) I + 1) • (rfe + bk)) < ip'k) I + 1) • + ^^n')) 



k=l 

n' 



k=l 



n'-l 



k=l 



e=i 



Pe+i - Pi 



It follows that 



M < 



2 • (ra' - 1) • (n + n') 



n'-l 



(6) 

K 



We now move on to the lower bound. Recall that n' > 10, and that, for each A; = 1, 2, ... , 

(resp., = n'-K,... , n'-l) we have < 2e; also, < (2iC+l)e andp^,.^ > l-(2ii:+l)e. 

Observe that 



2ir + 1 = 2 



n'-l 



+ 1 < 2 - + 1 < 2 < 2- 



TT-^ < ■ n . 



Since e = mini<j<„/_i p'-_^^ - p'-, we have e < and 

/ N 5 n' 25 
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We are now ready to lower bound M: 

n' 

M = ms^(R,B)>^±^ = ^.J2{{\r\k)\ + l)-irk + h)) 

k=l 

n'—l I I / \ 1 n' / fc— 1 



n'-l 



fc=l \ €=max(fc,n'-i<:) ^^+1 

1 / ™n(fe-i,i<:) , , 

^g((rH*)l-)- E -i^' 
>- ^ E (k-wKD^ E ?<i^<f±M), 

k=l \ i=max.{k,n'-K) 

^E((|rH.)Ki)."E"^-^^). 



using the upper bound we previously obtained for {2K + l)e, we obtain: 

, n' ( n'-l \ -I n' / min(fc-l,K) 

M > l-Y.\i\r'm + i)- E 1^ +^-E + E I 

k=l \ e=miL^{k,n'-K) / k=l \ £=1 

1 

^ ^ • E + 1) • - - ^) + - 1, i^))) 

^ fc=i 

1 

fc=i 

observe that (n' — /c) + (A; — 1) = n' — 1 and therefore at least one of (n' — A;) and A; — 1 is at least 

'n' - r 



n'-l ^ |n^- I _ n' + ^ ^ n' + 7/2 ^ 



= K. 



Therefore, at least one of min(ra' — k, iC), min(fe — 1, i^) is at least K. Then, 

^ ^ 2k (d^'^^^l + 1) • ^) = ^ -E (k-^(^)l + 1) = 1^ • + -0 > 

k=l k=l 

□ 



Therefore, going back to the probability that an urn identical to the correct urn is voted for, we have 

^ , , 2(ra' - 1) 1 2{n' - 1) 162 



eM n ~ J_ . {n'-i){n+n') n + n' 

81 e 
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5.2 An Upper Bound for Many Signals 

In this section we consider the voting problem in its full generaUty: we have a set of n > 2 urns, with each 
urn i = 1, . . . ,n inducing a distinct probability distribution Pi = {pi,i,Pi,2, ■ ■ ■ ,Pi,c) over a set of C colors. 
Let e be the minimum ii distance between the distributions Pj: 

c 

e = m.m£i{Pi,Pj) = min |pi,c - Pj,c| • 

c=l 

It turns out that the bichromatic scheme from Section ISTT] has already laid much of the groundwork for 
the multi-color case. Each voter u will behave as follows: 

1. First, u will choose a color c = c(n) uniformly at random from among all the colors. Voter u will 
then imagine the urns as inducing a bichromatic instance by imagining all colors other than c as a 
single color c. In this way, urn i becomes a bichromatic urn with distribution {pt^c, 1 — Pi,c) over its 
two colors. 

2. Then, voter u will choose an integer t = t{u) G {0, 1, . . . , T} with T = \log^ C~\ + 1, in such a way 
that Pr[t = i]= a'^ ■ 3"'^+*, where a = Ylf=o 



Observe that a < J2i^o 



3. Voter u will then apply the bichromatic voting scheme from Section [5?T] to choose which urn to vote 
for. She will set {pi,P2, ■ ■ ■ ,Pn} = {pi,c,P2,c, ■ ■ ■ ,Pn,c}, for i = 1, . . . , n; the sequence of the p-'s 
will be defined as follows: 

- first she will pick a subsequence according to the following marking algorithm: set wi = and 
mark all the pj's such that pj < 3^* • e; if some unmarked pj remains, let i = 2, and 

- set Wi to be the smallest unmarked pj, 

- mark all the pj's for which \pj — Wi\ < 3~* • e; 

- if some unmarked pj remains, repeat; otherwise, if Wi ^ 1, set tfj+i = 1; then, stop. 

4. let i* be the length of the sequence {wi}; if i* < 10, the voter will add 10 — i* elements to {wi}: 
let i be such that Wj+i — Wi is maximized; the voter will insert the values the values Wi + ^{wi+i — 
Wi),'Wi + l{wi+i -Wi),...,Wi + - Wi) in the hst, keeping it sorted. 

The size of the sequence {wi} will then be at least 10. 

The voter will then define the sequences Xi^i,Xi^2 as Xi^i = ?l^hd^£i+l^ Xi^2 = '^^h+2^h±L^ for i = 

5. the voter then merges the sequences Wi, Xj^i, 2, and sorts the resulting sequence increasingly; let 

yi < 2/2 < • • • < ysi* -2 be this sequence, and ec,t be its separation parameter: ec,t = T^i^i=i,...,3i*-3yi+i- 
Vi- 

6. then the voter adds elements to {yi} in such a way that: 

(a) the minimum separation between adjacent elements remains at least ec,t, 



(b) if the list has length n', then for each i = 1, . . . 



, it holds that yj+i — yi < 2ec,t, and 



(c) if the list has length n', then for each i = ^ , . . . , n' — 1, it holds that yi+i — yi < 2ec,t- 
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To do so, she applies the following algorithm: 

6.1 if (b) is not satisfied, i.e., if the list {yi} has currently length n' and i is a minimal index i for 
which there exists elements yi,yi+i such that i < y and yi+i — yi > 2ec,t, then insert a new 
element between y^ and y^+i of value yi + ec,u this will increase the length of the list; repeat 
this step as long as (b) is not satisfied; 

6.2 if (c) is not satisfied, i.e., if the list {y^} has currently length n' and there is a maximal index i for 
which there exists elements yi, y^+i such that i > ^ and yi^i — yi> 2ec,t, then insert a new 
element between yi and yj+i of value yj+i — ec,u repeat this step as long as (c) is not satisfied. 

It is easy to prove that the above algorithm guarantees properties (a), (b) and (c). Let n'^ ^ be the length 

fn' 1 

of the final sequence {yi}, K^^t = -5^ > and observe that the algorithm also guarantees that (d) 

<t > 10, (e) < i < 3r - 2 + 2{Kc,t + 1) < Oi* + 2 < 9n + 2 and (f) yK,,,+i < {2Kc,t + l)ec,t 
and yn'^ ,-Kc,t ^ 1 - (2^c,t + l)ec,t- 

The just-defined bichromatic instance depends only on the original multi-colored instance, on c and on 
t — we use (c, t)-instance to refer to the bichromatic instance induced by c and t. 

Observe that the separation parameter ec,t of the (c, t)-instance will be at least e^t > 3^*~^ • e, since 
the Wi's are at distance of at least 3^*^^ • e from each othei[^ and Xi^i, Xi^2 split the interval between Wi and 
Wi+i in three equal parts — the subsequently added y^'s do not induce gaps smaller that ec,t- Furthermore 
the number of landmarks of the (c, t)-instance will be 10 < n'^ ^ < 9n + 2 < lOn, since n > 2. 

We are now ready to prove Theorem 15. 11 using the machinery built in Section [STT] 

Proof of Theorem 15.71 First, given two urns Pi , Pj , we say that a color c is useful for i,j i^\pi^c — Pj,c\ > gfj • 
Observe that if Cij is the set of useful colors for urns Pi, Pj, we have 

X] \Pi,c-Pj,c\ > ^ ■ 

Indeed, since there are only C colors, the contribution to the £1 distance between Pi and Pj of their non- 
useful colors is less than C ■ = |. Given that the total distance is at least e, it follows that the useful 
colors contribute by more than y to the li distance of Pi , Pj . 

Suppose that i is the unknown urn. Let pi = pi^c and pj = pj^c in the (c, -bichromatic instance, for 
some c, t. Let E^^'^^ (j) be the expected number of votes that a voter will give to urn j, if i is the unknown 
urn, in the (c, t) -bichromatic instance. The analysis of the bichromatic case, guarantees that 

^(ct), N ^(ct), N 162 162 



n + n\,^ n 



and that the difference = -^f - is at least 

Ac,t), ^ max(|0(p^) - (t){pj)\ - 1,0) 

Fix a color c G Cij and let tc,i,j be the smallest non-negative integer such that 

\Pi,c-Pj,c\ >e-3-^-''-^. 



Before step 4 they were at distance at least 3 *e from each other, and step 4 could have added new Wi's at distance at least 
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Since c is a useful color we have |pj c ~ Pj,c\ > and therefore < tc,t,j < [logs + 1 = T. By 
Cc t ^ 3^*^^ • e, we obtain 

1 

^c,tc,!,j ^ ^ \Pi,c ~ Pj,c\ ■ 

Since |pj — Pj\ > e • 3~*'^'»'J , the marking algorithm run by the voters will mark pi and pj at different 
iterations — therefore, there are at least three landmarks between andpj. Itfollows that — > 

2. Therefore, 



Af ''(i) > 



Met - 



> 



> 



2 • 



K.t,,.^,-i)-("+<.t,,.,,) - 22n« 



> 



C,tn 



\Pi,c Pj,c\ 

17820 • n2 



Then, 



^'0) = EE 4 ■ ^'"'<^>) ^ ttt ■ E , A<«)o) > ■ E 

c=i t=i ^ ^ cedj cec'i i 

\Pi,c Pj,c\ ^ £ 



(J) 



1 

C-T 



E 

cgCi., 



and 



17820 • n2 - 26730 • C • T • n2 



-.«)^EE(i^-.'-'(i)).^ 

c=l t=l ^ ^ 



We choose m 



e 



- — In - I as the number of voters, and we 



V \ V '= ^ 

apply Chernoff bound (see Theorem 12.21 ). on Vj: the number of votes to urn j in the election, if i is the 
unknown urn. Observe that E[Vj] = m ■ Ei{j), and furthermore: 



Pr 



\V,-E[V,]\>--m 



= Pr 

< 2exp 

< 2exp 

< 2exp 

< 2 exp 



2 



X 



27Et{j) 

„2 



X 



27 E,{j) 



^E,{j) 
■ Ei{j) ■ m 

m 



Ei{j) ■ m 



27 • 267302 . C2 • r2 • n4 . 162 

n 



■ m 



e \ ^ V 

* TTl I — 

27 • 267302 • C2 • T2 • • 1^ - n' 



Applying the Union Bound over all the urns, we have that each single urn j will deviate by at most |m 
from its expected number of votes with probability at least I — rj, and since the expected difference of the 
number of votes of urn i and j if i is the unknown urn is at least m • Aj(j) > m • x, we have that urn i will 
win the election with probabihty at least 1 — rj. □ 



6 Other Voting Systems 

In this section we study other important voting systems, assuming that there are two types of signals; that is, 
assuming bichromatic urns. 
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6.1 Cumulative Voting 



We show that cumulative voting requires a smaller number of voters for the election to succeed with high 
probability. In fact, cumulative voting can be exploited to work with a number of voters as small as the 
number of samples used by the optimal centralized algorithm (that is, the algorithm that, after sampling the 
minimum number of balls, produces the right guess with high probability). 

Like plurality voting, in the cumulative voting election system, each voter has a single vote to cast; 
unUke pluraUty voting, though, the voter can spUt her vote arbitrarily between the candidates: 

Definition 6.1 (Cumulative Voting). Each voter assigns a score to each candidate, in such a way that no 
score is negative and the sum of the scores assigned by a voter is 1. The total score of a candidate is the 
sum of the scores assigned to that candidate by the voters. If there exists a candidate i having a total score 
larger than the total score of each other candidate j ^ i, then i is the winner of the election. 

Given urns pi,p2, ■ ■ ■ ,Pn, voting scheme we propose for cumulative voting is directly derived from 
the plurality voting scheme we proposed earUer; in the new scheme, there are only two possible votes: if a 

voter picks a red (resp., blue) ball then she will vote i?2, • • • , Rn) (resp., {Bi, B2, . . . , Bn)) — that is, 
she will assign a weight of Ri (resp., Bi) to candidate Pi, for i = 1, 2, . . . , n. The Ri's and the Bi's are 
those that we defined for the plurality voting scheme. 

Theorem 6.2. Let urns pi,p2, ■ ■ ■ ,Pn be given, with urn pi having a pi fraction of blue balls, and al — pi 
fraction of red balls. Let p\ < p2 < ■ ■ ■ < Pn- Also, let e be e = mini<j<„_i(pj-(-i — pi). Then, for 

Cumulative Voting, O ^e~^ In voters are sufficient to guarantee a probability of at least 1 — r] that the 
correct urn wins the election. 

Proof. Choose some r/ G (0, 1), and suppose the number of players is 



m 



9 2' 
150 • • In - 



We start by using the Chernoff bound to show that the number of voters that pick a ball of some color is 
concentrated. If > 5, let X be the number of voters picking a blue ball; otherwise let X be the number of 
voters picking a red ball. In both cases, X is the sum of iid binary random variables Xj, with Xj = 1 with 
probabiUty max(pj, 1 —pi) and Xj = with probabiUty min(pj, 1 —pi). Then, E[X] = m ■ m£Lx(pj, l—pi) 
and f < < m. By Chernoff bound, 

Pr \X - E[X]\ > - • ml < Pr - E[X]\ > - ■ E[X^ < 2 • exp (-— ■ E[X]\ 

m\ _ /. 2 



that is, with probability 1 — ry the absolute difference between the number of blue (resp., the number 
of red) balls picked and the expectation pi ■ m {{1 — pi) ■ m) by al most |m. 

For any j G [n], i 7^ j, let Vi{j) be the fractional number of votes to Pj in the random election, with 
unknown urn P^. Let Di{j) = Vi{i) — Vi{j). Observe that urn i beats urn j in the election (with unknown 
urn i) iff Di{j) > 0. The random variable Di{j) is the sum of m iid random variables D[{j) each taking 
value Bi — Bj if the corresponding voter picked a blue ball and Ri — Rj if she picked a red ball; we now 
bound the span S of values of D'-{j) — that is, we bound S = \{Bi — Bj) — {Ri — Rj)\. Observe that 
Bi -Bj = ^-^ and Ri - Rj = Also, 

^ _ ^ ^ 2 - jpe+i+pe) _ 2 - {pi+i + pe) 
^ Pe+i-Pe ^ Pe+i-Pe 
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and 

n—l n—l 

^. _ ^. ^ Pi+i +Pi _ Pe+1 + Pe 
^ jr^^Pw-Pi jr'jPw-Pi' 

Therefore i > j iff bi — bj > and r j — rj < — which implies that — bj) — (ji — r j)\ = \bi — bj\ + 
\ri — Vjl and thus the span S of D'-{j) is equal to S = \Bi — Bj\ + \Ri — Rj\. Furthermore, 

max(i,j) — 1 



„ ..... mi e 

4:=mm(j,j) 

Analogously, \ri — rj\ < i^^. It follows that the span of -D'(j) can be upper bounded by 



I — Bj I -|- I Ri — Rj I ^ 



Observe that Di{j) is a linear function of the number ni}, of blue balls picked (and the number rrir = 
m — rub of red balls picked): 

= rrib ■ (Bi - Bj) + • (Ri - Rj). 

Therefore, 

E[Di{j)] =m-pi-{Bi- Bj) + m-{l-pi)- {Ri - Rj) = m ■ Ai{j), 

where Aj(j) is the functional defined in Section[2l recall that we proved there that Aj(j) > ^57^. 

Recall that with probability 1 — r], \mb — m ■ pi\ < f • "i. If this event happens we have that, for each 

D.U) > Emm - I - m- S > m - (^\U) - ^ ■ = " > 

Therefore, urn i will beat each urn j / i, with probability 1 — tj. The proof is concluded. □ 

Observe that the previous bound is tight in a strong sense: no algorithm that picks o ^e^^ In balls, and 
produces a guess arbitrarily after having seen all their colors, is able to guess the right urn with probability 
at least 1 — rj. 



6.2 Condorcet Voting 

In this section we present a conjecture, and we elaborate on it, with the aim of showing that Condorcet 
voting is as good as Cumulative voting — and is thus optimal. We begin by recalling the definition of the 
Condorcet voting system: 

Definition 6.3 (Condorcet Voting). In a Condorcet election, each voter returns a (total) ordering of the 
candidates. Given two candidates i and j, we say that i beats j in a run-off election if more than half the 
voters ranked i higher than j. If there exists a candidate i that beats each other candidate j ^ i in a run-off 
election, then i is the winner of the Condorcet election. 

We observe that, in Condorcet voting, voters do not assign real numbers to candidates as in Cumulative 
voting — they rather return a discrete object: a permutation of them. 

There exist many variants of the Condorcet election. The differences between them lie in the way of 
dealing with ties (that is, when no candidate i beats each other candidate j in a run-off election). Our 
main theorem holds for any such variant, since our theorem will guarantee that no ties will exist with high 
probability. 

We start by defining a set of coefficients that will be useful for introducing a Condorcet voting scheme. 
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Definition 6.4. For k > and £ > 0, let Ck e be: 



Ck,e = (-1) 



k+e 



k + i 
k 



k + e + 2 



We define inductively the sequence b^/, indexed by two integers; if k < or i < then 6^ , 
notational simplicity we will use: 

k 



0; for 



Xkl 



E 

i,i>0 
i+j<k 



bi,k-i-j • bj/ 

{i + i){i+j + 2y 



Vk, 



i=0 



EM 



\k—i 



hi 



i + l 



Then, for k>Q,l>Q,we define b^^i = {k + 1) ■ {xk-i/ + Vk^i/ - Ck+i,e). 

The induction is well-defined: x^-i/ only depends on the 6j j's for which either (a) z + j < — 1, or 
(h) i < k — 1 and j = I; yk-i/ only depends on the bij's for which i < k — 1 and j = £. Therefore, the 
bk/s can be computed in the following order via the recurrence: for n = 0, 1, 2, . . . and for A; = 0, . . . , n, 
compute bk,n-k- We now make a conjecture on the bk/s- 

Conjecture 6.5. Let B{x, P) = X^^q X^^o {bk/ ■ ■ P^) . Then, 

(i) the series B{x,P) converges for 0<x<^, x<P<^, 

(ii) B{x, P) > Ofor < x < I, x < P < ^, and 

(Hi) B{x, P)dx = -p^- J^, forQ<P<\. 



Comments on Conjecture \6.5\ We now make some comments on the conjecture, indicating some possible 
approaches to settle it. 

(i) Numerical approximations indicate that B{x,P) converges for each 0<x<-^, x<P<^, but it 
diverges for x = P = ^. 

(ii) For A; > 0, and < £ < />:, let 



E 

i=0 



k + 2 
i 



■ bk f- 



'k,i-i , 



also, let 



Bi{x,P) 




By looking at the first few terms of Bi{x, P)'s Taylor expansion, it appears that B{x, P) = Bi{x, P). 

The Bi (x, P) expression, if Bi{x,P) = B{x, P), could be quite useful to prove non-negativity, since 
the afc f's seem all to be non-negative — if they are point (ii) of the conjecture directly follows. 

Also, if Bi{x, P) = B{x, P), then one can express bk^e (for each ^ > 0) in terms of bk.o, b^^i, ■ ■ ■ , b^^k- 

k 



bk,, 




h 




where [^] represents the unsigned Stirling number of the first kind with indices n > k. The last claim 
can be proved using the Bi{x, P) expression and the following expression for [Q + 1)^*, t > 0: 



1 



{Q + iy 



E 

i=0 



(-1)' 



E 

j=0 
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e = o 


e = i 


i = 2 


£ = 3 


£ = A 


i = 5 


k = 


1 


-2 


3 


-4 


5 


-6 


k = 1 


2 


-2 





4 


-10 


18 


k = 2 


3 


-8 


18 


-36 


65 


-108 


k = 3 




44 


20 


44 
3 


4U 
3 


80 


k = 4 




8I4 


53 


388 
3 


880 
6§8 


-610 


k = 5 


5 


15 


5 


-188 


3 


558 
5 



Table 1: The first few 6a:,£'s- 





i = 


£ = l 


^ = 2 


£ = 3 


i = A 


i = 5 


k = 


1 












k = l 


2 


4 










k = 2 


3 


4 


4 








k = 3 




56 


4U 
3 


16 

3 






k = 4 


3 


3 


50 


106 

3 


32 
3 




k = 5 


98 
5 


1214 
15 


2U12 
15 


656 
5 


1U16 
15 


46 

3 



Table 2: The first few ak/s. 



(iii) Since 



p + i y 1 + 2P 

n=0 



with 



r - j ^-CliD if^i^even _ l-3-(-ir fn^M]^. 



2Cl:j:l)-l if n is odd 2 VhA 

we have that the point (iii) of the conjecture states that, for n > 0, 

n , 

EOk,n-k _ ^ 



k + l 

k=0 

Tables [T] and [2] show how the bk/ and the ak/ sequences begins. 

□ 

We use Conjecture [63] to show the existence of a probability distiibution over permutations that induces 
a given set of "marginals". 



Lemma 6.6. Let < pi < p2 < • • • < Pn < 1- Then, if Conjecture 16.51 is true, there exists a probability 
distribution over the symmetric group Sn, such that, for each 1 < i < j < n, 

Uij = Pr [TT{i) < 7r(j)] = min ( 1, 3- 

V Pi+ Pj . 

Proof. To each probability p G [0, 1] we associate a random variable Xp with values: 
• if P < ^ , Xp is the constant random variable p, with a point mass 7p = 1 at p\ 
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if p > |, Xp has a point mass of < 7p < 1 at p; Xp has a density function fp{x), with fp{x) 
p{x) for X e [l — p, I] and fp{x) = (3p{x) for x G (^)P)> with 



a 



ap{x) = {p + x) ^, 



and 



By Conjecture I6.5[ we have that Pp{x) > and fy^ Pp{x) dx = ijpp^ — y | — 1; therefore the total 
probability mass assigned to Xp is 1 — Xp is then a well-defined random variable. 
The CDF associated to ap{x), for p > ^ and 1 — p < y < ^ is: 

^ I . , y-i+P 
i-p y + p 

The CDF associated to j3p{x), for p > \ and ^ < y < p is: 



ry CO CO / -I 



Observe that ifpy^q, then there's zero probability that Xp = Xg — since, for each p, Xp has a positive 
point mass only at p. Then, we pick a permutation vr of {1, 2, . . . , n} by letting 7r(i) < 7r{j) iff Xp- < Xp.. 
We verify that the marginals Ilj j of our distribution satisfy the requirement in the claim: 

• ifO<p<q<l — p, then: 

Pr [Xp <Xg] = l 



• if < p < ^ and 1 — p < g < 1, then: 

Pr [Xp < Xg] = Pr[Xg>p] = l- r {q + x)'^ dx 

Jl-q 



p + q 



• if i < p < g < 1. Let Pi = Pr [Xp < Xg < P2 = Fr [Xp < ^ < Xg] , P3 = Pr [^ < Xp < p < 
and P4 = 'Pr[l<Xp<Xg<p]. Then, 

Pr [Xp < Xg] = p^ + p^ + p^ + P4. 
We now show that Pi + P2 + ^3 + ^^4 = -p;:^, completing the proof. We start by computing Pi and P2: 

Pi= {{Q + ^y'^- / {p + y)~^dy) dx = / - — ■ — — ■ — rdx 

Ji-p\ Ji-p J Ji~p[Q + xyip + x) 

1 (2p+l)(l-p+g) 

2p - 1 2?+T 



{q-p){2q + l) {q-pf 
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i-p ' \ Ji-g ' ) 2p+l V 2g + iy 2p+l 2q + l 

As for P3 and P4, we have: 



We show that for our definition of B{x, P) = /3p_|_i/2 + V^)' the equation Pr [Xp < Xq] = Pi + 
P2 + -P3 + P4 is equal to Pi + P2 + -P3 + A = for each ^ < p < g < 1. 

The latter is true iff P3 + P4 = — Pi — P2; expanding the terms, we get the equation holds iff: 



2p+l \2q+ „_ , „- . 

\ 2 / 2\ 2 

1 2p - 1 , in 2i+i 2p - 1 



+ 



or, equivalently. 



p + q iq-p){2q + l) {q - pf 2p + 1 2q + l' 



f3,j{x) ■ Pp{y)dy \ dx - ^^^^ ■ (3q{x)dx = 

1 _^ 2q+l 2q - 1 



p + i? {q-pf -?')(2g + 1)' 

For notational convenience, we let P = p — ^ and Q = q—\. Let P(P, Q) = Pq{p) = /^q+i (-P + 5)- We 
thus need to prove: 

^ [b{x,Q)- Biy,P)dy^dx-^^- B{x,Q)dx = 

+ 777^^^ , ^. (7) 



P + Q + 1 (Q-P)2 (g + l)(Q-P) 

Since we will expand the right-hand side in a Taylor series around [Q, P) = (0, 0), we observe that the 
right-hand side has a removable singularity at P = Q. Indeed, letting P = Q — e, 



e-^o \ e2 (Q + l)e / €^0 \ e2 (Q + l)e 

e ef_ _ Q^e^ ^/ 3a _ 

Q+1 Q+1 2(Q+TF ^ '^^^ Q 



!™ol £2 (Q + l)e 

€-i>o e 



2 



1 



Q + 1 2(Q + 1)2' 
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since the limit from the left and the right coincide. Therefore, if P = Q, the right-hand side of Equation|7]is 

1 1 



Recalhng that 



we get that the following holds: 



2Q + 1 Q + l 2(g + l)2- 

oo oo 

BiP,Q) = ^^(bk,e-P'-Q'), 



e=o k=o 



/ (b{x,Q)- / B{y,P)dy]dx = Y,Y. 
Jo \ Jo J ^^ofc^2 



-j • 



. ij>0 
\i+j<k-2 



{i + l){i+j + 2) 



P^ ■ 



J 



e=0 k=2 



Also, 



and, sincep^ = Er=o((-l)'-^')> 



l=Q k=l 



b^.pk.Qi. 



P 



^ „p OO OO "fe — 1 / 

''^ 1=0 k=l \.i=0 ^ 



P'' -Q^ 



e=o k=i 



Furthermore, we have 



In 



(Q-P+1)(P+1) 
1 



P + Q + l 
Q 



^^ci{k,£)-P'^-Q' 

k=o e=o 



with 



k=0 1=0 

k 



ci{k,l) = {-lf+' 



k + e + 2 

observe that c^/ = ci{k, I) + C2(fc, tj, where c^^t is as in Definition 16.41 

We then have that Pi + P2 + Pz + Pa = (particularly. Equation |7]l is satisfied iff 

00 00 00 00 

E E ((^^-2/ - y^-i,d • • Q') = E E (^M • ■ Q') ■ 

i=0 k=0 i=0 k=0 

The equality holds iff for each fe, £ > 0, we have 

Ck,£ = Xk-2,i — Uk-l/- 
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Observe that if = 0, then the equation is satisfied, since cq/ = x^2,e = y-i,e = 0. If A; > 1, then 
Uk-i/ = ~ yk~2,e- Since we defined the b/^/s to be, for for each k,£ > 0, 

bk,i = (k + l) ■ {xk-i/ + yk-i,£ - Cfc+i/) , 

we have that Equation His satisfied and -Pi + ^^2 + ^3 + ^4 = D 

Using the previous distribution over permutations we can prove the main theorem of the section. 

Theorem 6.7. Let < pi < P2 < ■ ■ ■ < Pn ^ ^ be the blue-probabilities of a set of bichromatic urns. 
Let e = mini<j<„_i(pj_|_i — p.i). If Conjecture 16.51 is true, there exists a symmetric voting scheme for the 

Condorcet election that guarantees that the unknown urn wins with probability I — t] with O (^^^ — ) voters. 



Proof. Lemma |6]6] guarantees the existence of a probability distribution P over the set of permutations of 
{1,2,..., n} such that, for each 1 < i < j < n, Pr7r^p[7r(i) < 7r(j)] = min ^1, p.^p . ^ ■ If 7r(j) > 7r(i) 
we say that j beats i in vr. 

We also let qi = 1 — pi\ therefore < < < ■ ■ ■ < qi < I, and mini<j<„_i((7j+i — qi) = e. 
Lemma 16.61 again guarantees the existence of a probability distribution Q over the set of permutations of 

{1, 2, . . . , n} such that forn > i > j > 1, we have Pr7r~Q[vr(i) < vr(j)] = min ^1, 

Each voter will apply the following algorithm: if she draws blue, she sample a permutation according to 

P, otherwise she samples a permutation according to Q. 

Now, suppose the i-th urn is the unknown urn. Let j ^ ih& the index of any other urn. If j > i, 

Pr[the unknown urn i beats another urn j] = pi Pr [7r(i) > vr(j)] + (1 — Pi) Pr > 7r(j)] 

f 1 \ ^1 

= Pi max 0, 1 + qi min 1, 



Pi +PjJ \ qi+ Qj . 

if Pi + < 1 (and therefore qi + q^ > 1) the latter simplifies to ^ ; otherwise pi + pj > 1, + qj < 1, 

and the expression simplifies to p, (l - ^) + qi = p^ - ^ + I - Pi = 1 - ^ = 
Therefore, if j > i, we have 



Pr[the unknown urn i beats another urn j] G 



Pj 



Qi + Qj Pi + Pj 



If, otherwise, j < i, we have 



Pr[the unknown urn i beats another urn j] = pi Pr [7r(z) > vr(j)] + (1 — Pi) Pr [7r(i) > vr(j)] 

= Pimin(l, ] )+5jmax(0, 1 

\ Pi+PjJ V Qi + 1j . 

if qi + qj < 1 (and therefore pi + pj > 1) the latter simplifies to p.'^p . ', otherwise qi + qj > l,pi +pj < 1, 
and the expression simplifies to p, + q^ (l - ^) = 1 - q^ + q^ - ^ = 

Therefore, in any case, Pr[the unknown urn i beats another urn j] G | ™ g^^g '^^ ^ , — ^-^^^|- We lower- 
bound the latter two fractions: 

ma^{pi,Pj) _ max{pi,pj) _ max(pi,pj) - ^\pi - Pj\ \pi - Pj\ 



Pi + Pj 2 max(pi ,Pj) - \pi - Pj\ 2 max(pi ,Pj) - \pi - Pj\ 2 2 max{pi ,Pj) -\pi- Pj \ 
Pi+Pj J ~ 4 ' 
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and, analogously, 



Qi + Qj - 2 4 



1j\ 



Since, \qi — qj\ = \pi — Pj\, we have 

Prfthe unknown um i beats another um 

L -^^-24-24 



Now, given two ums i, j, let Xi (j) be the random variable counting the number of votes in which i > j, 
with m voters. Observe that if Xi{j) > then um i beats um j. Also, 



TTl /I \% — 7*6 

- < m • ( - + J ^ ) < E[Xi{j)] < m, 



and 



Pr 



\X,{j)-E[Xiij)]\>^-^-^.m 



< Pr 



- E[X,{j)]\ > . E[X,ij)] 



<exp[- l' ' . E[Xiij)] 



< exp 



75 

■|2 2 

150 



m 



By choosing m 
Pr 



150e-2 In ^ 



, we obtain: 



X,{j)-E[Xi{j)]\>^-^^.m 



< exp ( — |i — In 



Observe that if \Xi{j) - E[Xi{j)]\ < • m, then — by E[Xi{j)] > m ■ (^^ + — we get 

> rn- (^5 + ^^20^^ ) T' which imphes that um i beats um j. 
Applying the Union Bound over all the ums j ^ i, we obtain 

OO , / 

Pr[um i does not win the election] < 2 • (^^ j ~ ^ ' i — ^TT — ^' 



□ 
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